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SUMMARY 

A method  is  described  for  creating  computer  programs  to 
analyse  text  for  compliance  with  a specified  structure; 
examples  of  such  programs  are  command  decoders  and  format 
checkers.  The  paper  shows  how  analysis  programs  are  easily 
produced  by  constructing  a table  directly  from  a specification 
of  the  intended  data  format  and  presenting  this  as  input  to  a 
non-specific  parser  subroutine  which  has  been  written  and  is 
described.  Facilities  are  included  to  execute  user-written 
subroutines  when  key  structures  are  recognised  in  the  input. 

Programs  have  been  written  in  PDP-11  and  NOVA  assembly 
language  to  implement  the  parser,  and  instructions  for  use  of 
these  programs,  examples  and  listings  are  included. 
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1 . INTRODUCTION 

A commonly  occurring  requirement  in  assembly  language  programming  is  analysis 
of  input  text  strings  to  implement  activities  described  by  that  input.  Examples 
are  typed  commands  to  a real-time  program  or  mnemonic-encoded  instructions  to  an 
assembler.  Programmers  tend  to  solve  this  problem  by  writing  a decoder  routine 
specifically  tailored  to  the  expected  format;  if  it  is  permissible  in  the 
application,  the  format  is  usually  compressed  to  be  as  cryptic  as  possible  to 
simplify  this  task. 

Another  solution  is  to  use  a general-purpose  decoder  routine  which  operates 
on  a coded  description  of  the  format  expected  in  the  data  to  detect  the  presence 
of  salient  structures  and  direct  implementation  of  appropriate  actions.  This 
method  allows  the  programmer  to  concentrate  on  the  description  of  the  data  format 
and  the  execution  of  the  actions  invoked  by  the  input,  and  results  in  easier 
writing  and  debugging.  In  addition,  a rich  and  natural  command  format  is 
encouraged  and  expansion  is  easy  to  accommodate. 

The  code  produced  by  using  a table-driven  parser  is  often  very  similar  in 
execution  to  an  ad  hoc  decoder,  while  being  far  easier  to  understand.  The 
features  included  in  the  parser  can  be  tailored  to  suit  the  size  and  complexity 
of  the  job;  whilst  some  overhead  is  inevitable,  for  large  or  complex  formats 
the  efficiency  of  the  code  produced  by  the  two  techniques  is  comparable. 

The  routines  described  here  were  originally  developed  to  decode  keyboard 
commands  on  a real-time  data  logger  and  have  since  been  used  for  text  format 
verification.  They  are  principally  designed  to  recognize  structures  encountered 
in  command  strings,  but  other  structures  are  easily  included. 


2 . THEORY 


2.1  Preliminaries 

A good  way  to  describe  an  artificial  language  of  the  kind  commonly  used  in 
computing  is  by  a phrase-structure  grammar(ref . Ij . This  technique  consists 
of  grouping  together  basic  structural  elements  to  form  more  complex  items, 
which  in  turn  are  grouped  to  build  up  the  language.  An  example  is  a natural 
language  like  English,  where  letters  are  grouped  to  form  words,  words  to  make 
phrases,  clauses  etc.  and  these  parts  to  make  sentences.  The  grammar  is  the 
set  of  rules  which  defines  how  such  combinations  are  made,  i.e.  the  rules 
describe  the  syntax  of  the  language. 

It  is  also  necessary  to  use  a language  to  describe  the  grammar;  some  way 
must  be  found  to  set  out  the  rules  of  syntax.  This  can  be  done  in  English, 
but  problems  of  inexactness  of  meaning  arise  with  such  a rich  language.  A 
simpler  and  more  formal  language  (a  meta- language)  is  preferable  for  such 
description.  Backus-Naur  Form  (BNF)  is  a meta- language  which  was  developed 
to  describe  syntactic  rules  unambiguously.  A form  of  BNF  will  be  used  here; 
a summary  of  the  elements  of  BNF  is  included  in  Appendix  I (ref. 2). 

2.2  Table-driven  parsers 

The  process  of  examining  a data  sample  to  determine  whether  it  conforms 
with  a particular  grammar  is  parsing.  A simple  strategy  for  designing  a 
parser  program  is  to  develop  a table  which  directs  a set  of  general  routines 
to  carry  out  the  parse;  the  table  is  constructed  from  the  syntax  rules  of 
the  grammar  being  analysed.  This  is  a table-driven  parser;  the  table 
specifies  the  transitions  or  changes  which  occur  between  states  or  elements 
within  the  grammar.  The  table  is  a transition  table  or  state  table(ref . 3) . 

The  parsing  procedure  consists  of  starting  at  the  top  or  head  state  of  the 
transition  table  and  comparing  the  first  element  of  the  input  with  the 
syntactic  types  which  identify  the  columns  in  the  table.  Where  a match 
occurs,  a transition  is  specified  by  a pointer  in  the  table  to  the  next  state 
or  row.  This  process  continues  until  a transition  is  found  to  a special 
state  indicating  success  or  failure.  This  is  best  seen  by  example. 
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2.3  Constructing  a transition  table 

A language  is  described  below  in  BNF(ref.3).  It  consists  of  simple 
arithmetic  assignment  statements.  The  elements  which  make  up  the  language 
are  the  meta- components  <letter>,  •^ull>,  =,  and  <bperator>  as  defined  from 
basic  objects  in  Appendix  I. 

■sassignment  statement>  ^ = <letter>  <rest  of  a.s.> 

<rest  of  a.s.>  - = <«xpression> 

<expressior^  = <letteT>  <rest  of  exp.> 

<rest  of  exp.>  - = <5iull>  I <bperator>  <«xpression> 


This  grammar  allows  such  constructs  as  A = B + C and  D = A * G - D. 
The  transition  table  constructed  directly  from  this  grammar  is: 


State 

f 

<'letter> 

Meta  component 

<bperator>  = 

■^ull> 

1. 

•^assignment  statement^ 

2 

5 

5 

5 

2. 

•<rest  of  a.s.> 

5 

5 

3 

5 

3. 

<expression> 

4 

5 

5 

5 

4. 

<rest  of  exp.> 

5 

3 

5 

6 

5. 

ERROR 

6. 

SUCCESS 

The 

table  is  built  by 

considering  each  definition  in 

turn. 

If  the  first 

meta- component  on  the 

right-hand  side 

is  found,  the 

second 

must  be  found 

next  so  it  becomes  the  target  for  the  next  stage  of  the  parse;  thus  a 
pointer  is  entered  in  the  table,  specifying  a transition  to  that  state. 

The  parse  process  for  the  statement  A = B proceeds  as  follows: 

(a)  Entering  at  state  1,  the  first  input  element  A matches  <letter>  and  a 
transition  occurs  to  state  2.  Anything  which  did  not  start  with  a 
letter,  e.g.  the  statement  - B = C,  would  be  another  object  and  fail 
the  parse. 

(b)  In  state  2,  the  = causes  a transition  to  state  3;  statements  like 
A + B = C would  fail. 

(c)  B matches  <letter>  in  state  3 and  a transition  occurs  to  state  4 
where  •^ull>  causes  success.  Here  A = BC  would  fail,  but  A = B + C 
would  proceed  because  the  + would  match  an  *^perator>  and  a transition 
to  state  3 would  occur. 

2.4  Language  restriction 

The  above  language  definition  could  be  simplified  if  the  first  state  was 
changed  to: 

<assignraent  statement>  ••=  <letter>  = <«xpression> 

However  the  transition  table  cannot  handle  this  structure.  The  intermediate 
state  <rest  of  a.s.>was  necessary  so  that  in  the  right-hand  side  of  the 
definition,  a single  object  is  followed  by  a single  meta- component . This 
ensures  that  at  any  stage  in  the  parse  no  backtracking  is  necessary  if  the 
route  being  followed  fails(ref .3) . Parsing  is  then  fast  and  efficient  at 

the  cost  of  restricting  the  range  of  grammars  which  can  be  handled. 

Application  of  this  rule  will  produce  a right  linear  regular  grammar,  i.e. 
a finite  state  (Chomsky  type  3)  grammar(ref . 1) . Simple  precedence  languages 

are  produced  by  finite  state  grammars (ref . 3)  and  these  ensure  a unique  parse 
tree  through  the  grammar  for  any  input.  Every  finite  language  can  be 
generated  by  a regular  grammar(ref . 1) , and  the  majority  of  languages  used  in 
computing  are  finite  (i.e.  consist  of  a finite  number  of  combination  of  elements). 
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A program  to  analyse  according  to  a regular  grammar  is  a finite-state 
parser. 

2.5  Subsidiary  transition  tables 

A situation  can  occur  where  a particular  meta- component  constructed  in  the 
grammar  could  be  used  as  a pseudo-object  within  the  grammar.  An  example  of 
this  occurs  if  the  example  of  Section  2.3  is  modified  to  require  three  letter 
variable  names.  Then  the  BNF  definition  becomes: 


^assignment  statement> 
<rest  of  a.s.l> 

<rest  of  a.s.2> 

<rest  of  a.s.> 
<expression> 

<rest  of  exp.  1> 

<rest  of  exp.  2> 


= <letter>  <rest  of  a.s.l> 

= <letter>  <rest  of  a.s.2> 

= <letter>  <rest  of  a.s> 

= <5xpression> 

= <letter>  ^rest  of  exp.  1> 

= <letter>  <i‘est  of  exp.  2> 

= <letter>  <rest  of  exp.> 

= <^ull>  I <operator>  <«xpression> 


<rest  of  exp.>  ••=  <^ull>  I <operator>  ^xpression> 

The  same  kind  of  structure  has  been  invoked  each  time  the  variable  name 
appears.  It  would  be  more  efficient  to  define  a pseudo-object  in  a subsidiary 
transition  table  which  can  then  be  used  in  the  main  table  as  if  it  were  an 
object.  A successful  parse  in  the  subsidiary  table  would  be  equivalent  to 
matching  the  object.  The  effect  of  the  subexpression  structure  is  similar 
to  the  subroutine  concept  in  programming.  The  BNF  definition  and  transition 
tables  for  the  example  would  consist  of  a main  table  and  definition  as  in 
Section  2.3,  but  with  <letter>  replaced  by  <hame>,  and  the  subsidiary 
structures: 


<hame> 

*^nore  name> 
•^est  of  name> 


= <letter>  '^ore  name> 

= ■^etter>  <^est  of  name> 


:=  <letter> 


<letter> 


<bperator> 


<iiull> 


1 . <^ame>  2 5 5 5 

2.  ■^ore  name>  3 5 5 5 

3.  <rest  of  name>  4 5 5 5 

4 . RETURN 

5 . ERROR 

In  this  particular  example  another  solution  could  have  been  found  without 
using  the  subexpression  structure.  If  the  definition  of  a variable  name 
could  be  changed  to  be  one  or  more  letters  unrestricted  in  length,  then 
powerful  recursive  definitions  can  be  used  thus: 

<assignment  statement>  "=  <letter>  ^srest  of  a.s.> 


<assignment  statement> 
^rest  of  a.s.> 
<expression> 

<rest  of  exp.> 


<letter>  <rest  of  a.s>  I = <expression> 
<letter>  <rest  of  exp.> 

<letter>  <^est  of  exp.>  I <hull>  I <operator> 

<expression> 


2.6  Semantics 


The  parser  merely  determines  whether  or  not  the  input  is  syntactically 
correct.  Its  only  output  is  logical  - the  sentence  parsed  or  it  did  not. 
Normally,  the  input  is  intended  to  trigger  some  operation  within  the  program 
depending  on  its  content;  that  is,  the  input  has  meaning  or  semantics  in  the 
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context  of  the  program,  and  it  is  examined  in  order  to  decode  the  semantics. 
Semantic  structure  is  much  more  difficult  to  formalise  and  describe.  How- 
ever, in  a practical  table-driven  parser,  it  is  possible  to  allow  execution 
of  subroutines  each  time  a transition  occurs.  This  permits  the  programmer 
to  execute  code  in  his  program  at  key  points  during  the  parse  when  particular 
structures  are  recognized,  and  this  is  generally  adequate  for  implementing 
semantics . 


3.  IMPLEMENTATION 


3.1  Discussion 

In  order  to  implement  a finite-state  parser  on  a computer,  it  is  necessary 
to  have  a set  of  routines  each  of  which  recognizes  one  object  of  the  language, 
a formalism  for  storing  the  transition  table  and  an  executive  to  direct  the 
matching  and  transition  process.  In  practice,  some  extensions  permit  greater 
efficiency (ref . 4) . 

3.2  Transition  table  storage 

The  transition  table  will  be  stored  in  a computer  as  a list  because  of  the 
sequential  nature  of  computer  memory.  This  is  easily  accon5)lished  and  a 
number  of  redundant  transitions  removed,  by  altering  the  table  construction. 

A list  can  be  made  consisting  of  a series  of  cells,  one  for  each  transition. 
The  first  entry  of  each  csll  names  the  meta- component  or  object  to  be  matched 
and  is  equivalent  to  the  column  heading  in  the  table;  the  next  entry  points 
to  the  target  state  for  the  transition.  Furthermore,  all  meta-components 
which  are  not  allowed  in  a particular  state,  i.e.  will  produce  an  error,  can 
be  designated  by  the  pseudo-object  <anything  else>.  Then  the  transition 
table  for  the  example  of  Section  2.3  is  as  shown  in  figure  1(a). 

Normally  symbolic  labels  are  used  in  assembly  language,  and  thus  as  state 
pointers.  The  object  <anything  else>  always  points  to  the  ERROR  exit  so 
this  cell  can  be  contracted  to  a single  "end  of  state"  code  which  is 
recognized  by  the  parser  executive.  Tlie  final  list  for  the  example  is 
smaller  than  the  original  table  (figure  1(b)).  IVhen  many  objects  exist  in 
the  language  the  improvement  is  substantial. 

3.3  Pseudo-objects 

In  most  computing  applications,  structures  like  words  and  numbers  are 
needed  as  basic  elements  along  with  individual  ASCII  characters  like  key 
letters  or  punctuation  marks.  Conceptually  these  applications  can  be 
satisfied  by  defining  the  entire  character  set  as  objects  and  constructing 
words  and  numbers  in  subsidiary  tables  as  explained  in  Section  2.5.  However, 
it  is  more  efficient  to  include  words  and  numbers  as  pseudo-objects  by 
providing  routines  in  the  parser  to  recognize  them  directly.  The  objects 
which  this  parser  recognizes  include  the  individual  characters  of  the  ASCII 
set,  any  arbitrary  character,  specific  words  of  any  length,  any  arbitrary 
letter  string,  integer  numbers,  and  end-of-line  marks.  Other  objects  can 
be  added  by  users  who  write  the  appropriate  recognition  routines. 

3.4  Action  routines 

A facility  to  execute  user-written  subroutines  is  provided  by  expanding 
the  cell  in  the  transition  table  to  include  a pointer  to  a subroutine  which 
is  executed  if  the  transition  is  taken.  The  parser  includes  a null  sub- 
routine which  does  nothing;  then  in  those  transitions  where  no  action  is 
needed  a pointer  to  this  subroutine  is  placed.  Then  the  cell  of  the  trans- 
ition table  list  is: 


(state  label:)  code  indicating  type  of  object  to  match 


(any  trailing  arguments) 

pointer  to  action  subroutine  if  match  succeeds 
pointer  to  target  state  if  match  succeeds 


end  of  state  code 


3.5  Defaults 

A useful  feature  arises  if  another  pseudo-object  is  defined  which  always 
succeeds  so  that  its  action  routine  and  transition  are  always  implemented. 

Its  usefulness  lies  in  its  ability  to  allow  defaults  in  the  input  string. 

For  example,  a command  structure  might  be  decoded  to  determine  user-entered 

parameters;  if  a particular  parameter  is  missing,  the  test  for  it  would 

fail  but  by  following  this  with  the  matching  object  the  parse  can  proceed  to 

the  next  parameter,  leaving  an  initialised  default  value.  A specific  example  I 

of  this  is  set  out  in  Appendix  II.  .] 

3.6  Construction 

The  parser  is  written  as  a subroutine  which  is  called  from  the  user's  main 
program.  The  objects  or  syntactic  types  which  are  recognized  are  defined  as 
numeric  codes  for  constructing  the  transition  table.  Pointers  to  this  table 
and  the  text  string  to  be  analysed  are  passed  in  the  subroutine  call. 

Analysis  proceeds  in  the  parser  by  using  the  object  codes  to  vector  to  a 
routine  to  match  that  object;  if  the  match  occurs  a success  return  executes 
the  action  subroutine  for  the  transition,  then  selects  the  target  state  to 
find  the  next  object  code.  If  the  match  fails,  a failure  return  skips  the 
action  routine  and  target  pointers  to  select  the  object  from  the  following 
transition.  If  the  end-of-state  code  is  found,  control  returns  to  the 
calling  program  with  an  error  code;  if  a success  state  is  entered,  a success 
code  is  returned.  If  the  parse  failed  a pointer  to  the  place  in  the  input 
where  failure  occurred  is  available. 


4.  HOW  TO  USE  THE  PARSER 
4.1  Calling  sequence  for  PDP-11 

The  subroutine  call  'JSR  PC,  PARSE'  initiates  parsing.  Register  R0  will 
contain  the  result  of  the  parse  on  return  - R0  = 0 if  successful,  = -2  if 
failed.  Registers  R3  to  R6  are  unaffected  by  the  parse,  although  one  word 
of  stack  space  is  temporarily  used  for  each  level  of  subexpression  evaluation. 

The  parser  returns  to  the  caller  by  'RTS  PC'. 

When  the  call  occurs,  R1  must  point  to  the  first  byte  in  a byte  array  of 
7-bit  ASCII  text  to  be  analysed.  If  the  parse  fails,  R1  will  point  to  the 
first  unrecognized  byte.  Register  R2  must  point  to  the  head  state  of  the 
transition  table  defining  the  grammar  which  the  text  must  obey.  The  cells 
of  this  table  have  the  format 

syntactic  type  code  j 

any  trailing  arguments  j 

address  of  action  subroutine  (pointer  for  indirect  jump)  j 


address  of  target  state 

The  end-of  state  code  is  -1.  The  address  of  the  "do  nothing"  subroutine  is 


available  by  using  the  symbol  "NULL."  The  parser  executes  action  subroutines 


by  'JSR  PC,  @ POINTER'  so  they  must  resume  the  parse  by  'RTS  PC'.  Action 
subroutines  must  preserve  registers  R1  and  R2  and  tidy  the  stack  if  used. 

The  parser  is  assembled  into  its  own  program  section  to  avoid  symbol  clash. 
The  symbols  'NULL.'  and  'PARSE'  are  declared  global,  as  are  'NUM.'  and  'CHAR.' 
(see  Section  4.3). 

4.2  Calling  sequence  for  NOVA 

The  instruction  'JSR  PARSE'  will  initiate  parsing.  This  passes  the  return 
address  to  the  parser  in  ACS.  Accumulator  ACl  must  hold  a word  pointer  to 
the  text  string  to  be  analysed,  stored  as  7 bit  ASCII  bytes  in  .TXTM  1,  i.e. 
the  first  character  in  the  top  byte  (MSB's)  then  the  second  in  the  low  byte. 

AC2  must  contain  a pointer  to  the  head  state  of  the  transition  table  which 
has  the  format  for  each  cell 

syntactic  type  code 

any  trailing  arguments 

address  of  action  subroutine 

address  of  target  state 

The  end-of-state  code  is  -1.  The  user  must  define  page  zero  locations  for 
CHAR,  and  NUM.  and  the  mask  variable  IDMSK  containing  377  octal.  The  parser 
uses  locations  20,  21  and  30.  Auto  increment  register  20  is  assigned  the 
label  STPT  by  the  parser  and  is  used  to  hold  the  pointer  to  the  state  table; 

21  is  TXPT  and  points  to  the  text  string  word  by  word,  and  30  is  STACK  and 
indexes  a 10-word  stack  which  allows  5 levels  of  nesting  of  subexpressions. 

After  parsing,  AC0  carries  back  a result  code  to  the  caller.  If  a 
successful  parse  occurred,  AC0  =0;  if  i "ailed,  AC0  = -1.  If  the  parse 
fails,  the  page  zero  register  21  (TXPT)  is  pointing  to  the  word  containing 
the  byte  which  caused  the  failure,  except  that  if  the  first  byte  in  the 
string  fails  TXPT  will  point  one  word  before  it  because  of  auto  increment 
management . 

Action  routines  are  called  as  subroutines  via  'JSR  @ POINTER'  so  they 
must  resume  the  parse  by  'JMP  0,  3'  or  similar  arrangement.  No  other 
accumulators  need  preservation  by  routines.  The  'do-nothing'  subroutine 
provided  by  the  parser  is  at  address  'NULL.'. 

4.3  Syntactic  types 

The  following  entries  indicate  the  octal  number  code  for  each  syntactic 
type  with  the  symbols  assigned  to  them  by  the  parser  in  definition  statements, 
plus  a brief  description  of  the  objects  they  represent. 

4.3.1  ALPH.  = 1 

This  matches  all  following  alphabetic  characters  until  a non- 
alphabetic  character  is  found.  It  amounts  to  an  "ignore  text" 
specification,  allowing  optional  extensions  on  key  symbols,  etc. 

The  entry  has  a trailing  one-word  argument.  The  low  byte  is  a 
maximum  byte  cotint  to  limit  the  amount  of  text  to  be  ignored,  with 
0 corresponding  to  the  largest  limit  (256  characters).  The  high 
byte  is  a flag  indicating  whether  spaces  and  tabs  count  as  alphabetic 
or  non-alphabetic  characters;  flag  = 0 if  the  former,  1 if  the  latter. 
If  flag  = 1,  the  match  will  cease  if  a space  or  tab  is  found. 

This  type  only  fails  if  the  character  count  is  exceeded.  Note  that 
it  succeeds  if  there  are  no  alphabetic  characters,  since  its  effect 
is  to  push  the  text  pointer  on  to  the  next  non-alphabetic  character, 
which  can  be  the  next  byte. 

4.3.2  NUMB.  = 2 

This  matches  a decimal  integer  whose  value  is  stored  in  the  parser- 
defined  location  'NUM.'  (global)  in  a PDP-11  or  'NUM.'  (page  zero)  in 
a NOVA.  Conversion  stops  at  the  first  non-numeric  character.  It 
always  succeeds  - if  no  numerals  are  present,  NUM.  = 0. 
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4.3.3  KEY.  = 3 

Key  words  are  one  or  more  specific  ASCII  characters,  e.g.  "NAME" 
or  These  follow  the  type  specification  in  the  transition 

table  as  trailing  arguments,  and  terminate  with  a null  byte.  The 
NOVA  assembler  will  correctly  store  them  if  in  .TXTM  1 mode  the  .TXT 
pseudo-operator  is  used.  The  PDP-11  MACRO  assembler  provides  a 
.ASCIZ  pseudo-operator;  the  .EVEN  pseudo-operator  must  be  used 
after  the  string  to  return  to  word  boundaries.  For  single  character 
keywords,  a literal  argument  is  possible  - for  PDP-11  use '. BYTE  'X,0' 
and  for  NOVA  ’256.*"X' . 

4.3.4  EOF.  = 4 

Equivalent  to  the  null  operator  in  the  examples  of  Section  2.3, 
this  matches  ESC,  ALT  MODE,  FORM  FEED,  RUBOUT,  CR  codes.  On  a 
PDP-11,  if  a CR  is  found  and  it  is  trailed  by  a LF,  the  LF  is  also 
matched  and  the  text  pointer  will  indicate  the  character  after  the 
LF.  The  particular  terminator  found  is  available  to  the  action  sub- 
routine in  register  R3.  In  a NOVA  the  particular  character  is  in 
AC0. 

4.3.5  MTCH.  = 5 

This  always  succeeds.  It  does  not  move  the  text  pointer.  It 
forces  action  and/or  a new  state. 

4.3.6  ANY.  = 6 

This  puts  the  next  character  into  the  parser-defined  location 
'CHAR.'  (global)  on  a PDP-11  or  'CHAR.'  (page  zero)  on  a NOVA. 

It  succeeds  for  all  characters  except  EOF  (14)  or  rubout  (377). 

4.3.7  SUBX.  = 7 

This  code  asks  the  parser  to  match  a subexpression,  i.e.  to 
attempt  to  match  a structure  defined  by  a subsidiary  tr2insition  table. 
The  type  entry  is  followed  by  a single  word  trailing  argument 
containing  the  address  of  the  head  state  of  the  subsidiary  table. 

This  subexpression  state  table  must  contain  a state  with  the  type 
'ENDX.'  as  the  success  exit  to  return  to  the  main  table. 

4.3.8  ENDX.  = 10 

This  is  the  success  exit  from  a subexpression.  It  causes  the 
parser  to  carry  out  the  success  action  and  target  in  the  transition 
which  called  the  subexpression.  It  has  no  action  or  target  entries 
of  its  own  in  the  subsidiary  transition  table. 

4.3.9  END.  = 0 

This  type  indicates  successful  parse  and  must  be  present  in  the 
transition  table.  It  cannot  be  included  in  a subsidiary  transition 
table  for  a subexpression.  It  causes  the  parser  to  return  control 
to  the  user's  program  with  the  success  code  in  R0  or  AC0.  It  is 

the  only  exit  for  a successful  parse.  It  has  no  action  or  target 
entries  associated  with  it. 
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5.  CONCLUSION 

It  has  been  found  in  practice  that  programs  can  be  written  and  debugged  more 
easily  and  faster  using  the  structure  described,  compared  with  individual 
algorithms  for  text  decoding.  The  routines  described  and  listed  in  the  appendices 
have  been  used  and  found  to  match  their  descriptions.  It  is  recommended  that 
consideration  should  be  given  to  using  them  in  any  text  or  command  handling 
routines,  or  to  implementing  a similar  structure  for  special  purposes.  Their 
use  can  save  considerable  program  development  time. 
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APPENDIX  I 


BACKUS-NAUR  FORM  (BACKUS  NORMAL  FORM) 


BNF  is  a formalism  for  describing  languages  which  consists  of  a set  of  definition 
statements.  It  contains  two  metalinguistic  connectives  which  cannot  be  part  of 
the  language  being  described.  These  are 

"=  meaning  "is  defined  by";  it  is  a single  symbol 

I the  logical  OR  operator 

The  other  components  of  the  formalism  are  metalinguistic  variables,  being  sequences 
of  characters  enclosed  in  angle  brackets  <>  having  symbolic  meaning,  and  objects 
or  marks  which  are  basic  undefinable  elements  of  the  language  being  described. 
Juxtaposition  of  marks  and/or  variables  in  a formula  signifies  juxtaposition  of 
the  sequences  denoted.  Statements  in  BNF  consist  of  concatenation  of  meta- 
linguistic variables,  metalinguistic  connectives  and  objects  in  the  form 

<metacomponent>  ••=  <variable>  <variable>  


For  example,  the  statement 

<^perator>  ••=  + 

is  a valid  BNF  definition.  The  variable  <bperator>  is  defined  as  the  symbol  + 
which  is  an  object  ("plus  sign"  - it  cannot  itself  be  defined). 

Similarly 

<bperator>  ••=  - 

<bperator>  "=  * 

<jperator>  "=  / 

define  alternative  meanings  of  the  variable  or  meta-component  <<)perator>. 

This  definition  is  made  more  convenient  by  using  the  OR  operator 

<Dperator>  ••=  +1  -I  *1  / 

Similarly 

<digit>  "=  li  2i  3i  4i  5i  6i  7i  8i  9i  0 

Then  more  powerful  meta-components  are  built  up  by  combining  objects  and  meta- 
components thus : 

<integer>  ••=  <<iigit>  I <5digit>  <j.nteger> 

<sign>  "=  +1  - 

<fractional  part>  -=  <<iigit>  1 <fractional  part>  •<iigit> 

■'^signed  number>  "=  <Lnteger>  I •^nteger>  .<fractional  part> 

•^umber>  "=  <«igiv>  <unsigned  number>  I Consigned  nun4)er> 

The  meta-components  ‘^ull>  and  <«mpty>  signify  the  null  set  of  symbols.  In  a 
practical  line-structured  input  they  would  correspond  to  end  of  line. 


The  same  example  is  coded  below  for  both  PDP-11  and  NOVA  computers.  The 
PDP-11  program  uses  RSX-llD  operating  system  directives  for  input  and  output, 
and  the  NOVA  program  uses  RTOS  directives;  however  both  should  still  be 
comprehensible  without  understanding  these  directives. 

The  example  decodes  a command  string  typical  of  an  RSX  system  command  (it 
requests  creation  of  a User  File  Directory  entry  on  a storage  unit) . The  command 
has  a mandatory  format  specifying  a device  and  a user  code 

UFD_XX:[N,N] 

where  represents  space,  X represents  letters  and  N digits. 

A number  of  options  can  be  included,  and  these  are  shown  in  brackets 

UFD_XX(N):(XX )lN(NN ),  N(NN )]  (/PRO  = [R,R,R,R]) 

(/ALLOC  = NN ) 

where  R represents  any  combination  of  <^ull>  I Rl  Wl  El  D 
Thus  a legitimate  command  incorporating  all  options  would  be 

UFD DK1;SCRATCH  [200,200]  /PRO  = [RWED,  RW,,R]  /ALLOC  = 300 

Action  routines  are  not  shown  in  the  examples  but  would  be  assembly  language 
subprograms  to  cause  the  system  executive  to  carry  out  the  operations  specified 
by  the  command. 

The  finite-state  grammar,  in  BNF  using  pseudo-objects  available  in  the  parser, 
follows : 

<:ommand>  -=  UFD  •^ext  1> 

<5iext  1>  "=  <letter>  <iiext  2> 

■^ext  2>  "=  <letter>  <5\ext  3> 

<^ext  3>  "=  <^umber>  <<levice  term.>  I <iievice  term.> 

<device  term.>  "=  : <^ext  4> 

<hext  4>  '•'■=  <word>  <^ext  5>  I <5iext  5> 

<^ext  5>  -=  <^ic>  <opts> 

<3pts>  ••=  <hull>  I /<^ext  6> 

<hext  6>  ::=  ALLOC  = <alc>  I PRO  = <pro> 

<a|  c>  *=^umber>  <bpts> 

<^ro>  -=  I <^est  pro> 

^rest  pro>  ".=  ]<>pts>  I ,<i‘est  pro>  I R <xest  pro>  I W <rest  pro>  I 

E <i-est  pro>  I D ^rest  pro> 

Here  <iic>  has  been  used  as  a pseudo-object  for  illustration.  It  must  be 
defined  in  a subsidiary  definition. 

<uic>  "=  Kil> 

<ul>  "=  •^umber>  <5j2> 

<u2>  ::=  ,<u3> 

<u3>  "=  <humber>  <i4> 

<u4>  "=  ]<hull> 

These  definitions  are  translated  directly  into  transition  tables  in  the  examples 
whose  listings  follow 


Ik 
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TCJIf'  - TEST  PrtPSEI.' 

1 

2 

3 

4 

5 

6 


MiiCRO  61013  24-(*UC-Tr  12i30  PftCE  1 

TITLE  TESTP  - TEST  PftRSEK 

iC  CRinSLE  30/9/7S 

;nOOIFIEO  TO  hem  type  symbols  26/1/77 

; HILL  LINK  TO  PARSER . OBJ  AT  TKB 


8 

OOCOOO 

END  >0 

9 

ooooot 

ALPH  >1 

10 

000002 

HUNB  >2 

1 1 

000003 

KEY . «3 

12 

000004 

EOF  *4 

1 3 

000003 

HTCH  -3 

: 4 

000006 

ANY  *6 

13 

000007 

SUBX  >7 

1 6 

OODOlO 

ENDX .-lO 

1 • 

:& 

1^ 

20 

nCALL 

EXITSS  DIRS  QlOUl 

000000 

TESTPi 

D1R9 

• PRMPT  MNOICATE  READY 

21 

000006 

103774 

BCS 

TESTP 

2 2 

COoO 1C 

DIftI 

tCML  JCET  COMMAND 

'*  3 

0000 1 6 

103770 

BCS 

TESTP 

2 4 

C00020 

02f  72  7 

000240 

OOOOOOC 

CtlP 

lOSTjflE  EOF  ;IS  IT  CTRL-2'> 

; 5 

0 OOVfcto 

001441 

BCO 

EXIT  JTHEN  FINISH 

2 to 

000050 

011  70  1 

000272 

MOV 

I0ST*2pR1 

A < 

000034 

06770  1 

0002'0' 

ADD 

tCMC/Fl  IPOINT  TO  LAST  BYTE 

20 

OOOC40 

1 1C71  1 

00022  1 

MOVB 

I05T4i2(Rl>  ;STORE  TERMINATING 

2« 

00r044 

012701 

000270' 

MOV 

•CMB/Pl  ;PREPARE  TO  PARSE 

30 

000050 

012 702 

000412' 

MOV 

•START . R2 

31 

000034 

004767 

OOOOOOC 

OSR 

PC.PARSE  ;PARSE 

32 

0 00  0 toO 

oor  700 

TST 

RO  ;COOD  PARSE'> 

33 

C00062 

001  41  6 

BEQ 

SUX  ;YES  - INDICATE 

34 

000064 

010167 

000166 

MOV 

R 1 p ERP^Q . I OPL  ;N0  - PRINT  UHY 

33 

000070 

1 6270  1 

000270' 

SUB 

•CMBjRI  )SET  BYTE  COUNT 

3to 

000074 

160167 

000166 

sue 

Rl.  I0ST42 

3 r 

000 1 00 

016767 

000162 

000132 

MOV 

I0ST>2/ERP4O  I0PL42 

3 1; 

OOn 1 Ob 

DXRI 

• ERP 

30 

000  1 14 

000167 

177660 

JMP 

TESTP 

40 

000 1 20 

SUX: 

DIRS 

• SUXMES 

4 1 

000 1 2to 

0001*7 

177646 

J MP 

TESTP 

42 

OUO 1 32 

EXIT.' 

EXXTtS 

43 

000 1 40 

PRHPT: 

QlOUl 

10  UVBp3p24. p p P <PRPT . 4. 44> 

44 

000162 

124 

123 

124 

PRPT  ! 

.ASCII' 

TST>' 

000U3 

076 

43 

000 1 66 

CMLi 

QlOUl 

I0.RLB.Sp24pp  IOSTp  p<CHBp82  .p40> 

46 

000210 

SUXNES 1 

OlOUl 

10. yVBpSp24p  p pp  <SMSp  B p 40> 

47 

000232 

123 

123 

103 

sns: 

. ASCII ' SUCCESS  1 * 

000233 

103 

103 

123 

000240 

123 

04  1 

48 

C00242 

ERPi 

QIOHI 

10  HV6.S.24...,<CnB.40  .40> 

49 

000264 

000000 

'100000 

lOSTs 

WORD 

0,0 

SO 

000270 

CHB: 

BLKB 

82. 
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rcsTF  - ;es7  ptin^ef 

NACRO 

01013  24-AUC-77  12^50  PACE  2 

s: 

i STATE 

TABLE  FOR  COtlRAHO 

33 

; UFD  DKO :S 1 XNAHC  100. 1 00  )/PRO«C RUED« RUEDp RWED4 RUED 3/ALL0C-300 

94 

; 

95  000412 

000003 

START: 

KEY  . 

96  000414 

125 

106 

104 

ASCIZ'UFD  ' 

000417 

040 

000 

57 

EVEN 

000-!22 

ooooooc 

NULL  . 

59  000424 

000430' 

Nt 

(0  000426 

1 77777 

- 1 

t\  000430 

000006 

HI  1 

ANY 

62  000432 

oouococ 

NULL 

63  000434 

00044  0 ' 

N2 

64  0004  ib 

1 77  77  7 

-1 

65  000440 

000006 

N2: 

ANY 

66  000442 

OOOOOOC 

HULL  . 

67  000444 

000450 ' 

N3 

6d  000446 

1 77777 

-1 

63  00C430 

000002 

N3 ; 

NUHB  . 

70  000452 

ooooooc 

HULL 

71  000454 

000466 ' 

DEVI 

72  000456 

C00005 

niCH . 

73  U004C0 

ooooooc 

MULL 

74  000462 

000466 ' 

DEV  1 

75  000464 

1 77777 

- 1 

76  000466 

000003 

DEVI  i 

KEY 

77  000470 

07  2 

000 

BYTE  ' I . 0 

7S  000472 

OOcOOOC 

HULL 

79  000474 

000500 ' 

N4 

80  000476 

177777 

- I 

31  000500 

OOoOO 1 

N4  1 

ALPH 

£2  OCC-502 

000406 

406 

83  000504 

OOOOOOC 

NULL  . 

84  0005C6 

00i'5l2' 

H5 

85  0005:0 

1 77777 

- 1 

86  000512 

000007 

N5  ) 

SUBX  . 

37  0005  14 

000704' 

UIC 

38  00C5I6 

OOUOCOC 

NULL 

89  oni.5^0 

001-524' 

OPTS 

90  000522 

1 77777 

- 1 

91  000524 

006004 

OPTS  : 

EOF 

92  000526 

OOOOOOC 

NULL  . 

93  000520 

006764' 

ENOIT 

94  C0C532 

000003 

KEY 

95  0C0534 

057 

000 

BYTE  '/#0 

96  00C536 

ooooooc 

NULL 

97  000540 

00i>544  ' 

N6 

98  000542 

1 77777 

-1 

99  000544 

000003 

H6! 

KEY 

100  000546 

101 

114 

114 

ASCIZ'ALLOC*' 

00C'5l 

117 

103 

075 

000554 

000 

101 

EVEN 

IC2  00C556 

onooooc 

HULL  . 

103  000560 

006600 ' 

ALC 

104  000562 

000003 

KEY 

1C3  000564 

120 

122 

117 

ASCIZ'PRO*' 

13 
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TESTP  - 

TEST  P4PSEP 

riACRO  0101  3 24-AUG-77 

12:50 

PAGE  2-1 

000567 

075 

000 

106 

. EVEN 

lor 

000572 

OOOOOOG 

NULL 

103 

000574 

000610' 

PRO 

103 

00C576 

177777 

-1 

1 1 0 

000600 

000002 

ALCi 

Nune 

1 1 1 

000602 

OOOOOOG 

NULL  . 

112 

000604 

OOC  524  ' 

OPTS 

113 

000606 

1 77777 

- 1 

114 

000610 

000003 

PROi 

K EV  . 

115 

000612 

133 

000 

BYTE 

't,0 

116 

000614 

OOOOOOG 

MULL 

1 1 7 

C0C616 

000022 ' 

SPRO 

1 1 8 

OOC620 

1 77777 

- 1 

1 1 3 

00C6J2 

000003 

SPRO  1 

KEY 

120 

00C624 

135 

000 

BYTE 

' 3-0 

12  1 

ooc-toce 

OOOOOOG 

NULL 

122 

000630 

000524  • 

OPTS 

123 

000632 

000003 

KEY 

124 

000634 

054 

000 

. BYTE 

' , , 0 

125 

000636 

OOOOOOG 

MULL 

126 

000640 

000G22  ' 

SPRO 

127 

000642 

OCOOC3 

KEY 

128 

000644 

122 

000 

BYTE 

' R.  0 

123 

000t46 

OOOOOOG 

NULL  . 

130 

000650 

000622- 

SPPO 

13  1 

0 0 Ci  6 5 2 

000003 

KEY  , 

132 

0 G C*  6 5 4 

127 

000 

BYTE 

' U.  0 

133 

000656 

OOOOOOG 

MULL 

134 

000660 

000622 ' 

SPRO 

135 

000662 

000003 

KEY 

136 

000t>64 

105 

000 

BYTE 

'E-0 

137 

0 0 0 » C 6 

OOOOOOG 

NULL 

138 

0 0 0 b ; 0 

000022 ' 

SPRO 

•33 

0 00«-  72 

000003 

KEY  . 

MO 

0 r-  V » • 7 4 

104 

000 

BYTE 

' D,  0 

Ml 

000676 

OOOOOOG 

NULL 

M2 

0 0 0 7 00 

Q0o022 ■ 

SPRO 

143 

(iuoru2 

1 7 ,- 77  7 

- 1 

M4 

GOT'  704 

000003 

UIC< 

KEY 

145 

n 0 C'  7 0 b 

133 

000 

BYTE 

' [ , 0 

Mb 

000710 

OOOOOOG 

NULL  . 

M7 

000^12 

00071  6 ' 

U1 

M8 

000714 

1 77777 

-1 

143 

000716 

000002 

(111 

Hune . 

150 

000720 

OOOOOOG 

NULL  . 

151 

000722 

000726 ' 

U2 

152 

000724 

177777 

“ 1 

153 

000726 

000003 

U2  1 

KEY 

154 

000730 

054 

000 

BYTE 

' , , 0 

155 

000732 

OOOOOOG 

NULL  . 

156 

000734 

000740' 

U3 

157 

000736 

1 77777 

-1 

158 

000740 

000002 

U3  : 

Nune . 

159 

000742 

OOOOOOG 

NULL  . 

160 

000744 

000750  ' 

U4 

161 

000746 

1 77777 

- 1 

»IRE-TR-1842(A) 
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rESTP  - TEST  P«RSEr  MACRO  01013  24-AUC-77  11:50  PACE  2-2 

1b2  000:50 
1b3  00C7S2 
ti'4  000754 
US  00073b 
Ibb  000760 

167  000762 

168  000764 

169 

170 


! 

i 

i 

! 

i 

I 

1 

i 

I 


000003 

U4  : 

tCEV 

135  000 

OOOOOOC 

000762' 

1 77777 

. BYTE 
MULL 

US 

-1 

' 1,0 

000010 

US  : 

EHOX 

000000 

ENDITi 

END 

000000' 

. END 

TESTP 

I 

f 


i 


TESTT  - TEST  PARSES  NACRO  01013  24-AUC-77  12:50  PACE 

SYUSGE  r„ELE 


acCKorttocttoeoe 

oooooooooo 

oooooooooo 

oooooooooo 


ui  Cl  qc  ia 

r « o 

X V>  u mm 

^Ui^^^tMnmmmm 

v>^300000«»4» 


oc  tt  tt  oe 

oo»-o«:«forM-»of^ 

OO0OO(MsA^O«« 

OOOOOOOOOO 

oooooooooo 

oooooooooo 

« N I N n N 

X o ^ Qc  m 

u a.  &.  (o  ^ 
ooooo  ooex 

■ ■ c a ^ o o 

OOOOOOCAOOO 


ttttoroecx  wttot 
00*>i^^  *00(M(^0 

•*>o  — ♦fAi  • 

'T  • •«n6^00 

00000*00000 
ooooo  » ooooo 
ooooo • ooooo 


UJ  a Ui 

t/f  o>  a.  H*  o o 

fK  X o a.  »>•  M 
nYinsoa.<xttoe0e 
X zsxoa.(^^a.oo 


>C  X X X 

o o a c9 

a:  ac  oc  « 

fM*v*  •mtnvfMOO 

tA  » sS  * *oo*ow^ 

— ♦IN'*  *00*0^^ 

0*c*  *00*000 
0*0*  *00*000 
0*c*  *00*000 


^uj^oex  x^o 

-«  . i/>  :-  o -I  X 

XUJ0OOUi*~O0-«f^ 

W*-t»«A««M<,£XXXXX 


oc  Qt;  « oc  a:  a: 

o*«<*o%*sft<roo*<N 
ooor.  .*v*vr<^oc^ 
W r>  o I'l  •«  t f-  O O O <M 

o c O H;  i'>  O O O O O 
OOOOOO  OOOOO 

oocoooooooo 


oa.><A^>oooi..o. 
->  -I  X Ill  £ x x o 

<x  T : j o I V lu  iA>  ui  lu  uj 


> > 0. 
a « 

o o UJ  to 

X c C Ui 


-I  u o ^ 
U>  <C  Ul  o 
or.  o X N 

0 *<  xX  a.  •• 

01  a;  X <x  o. 

tt  •«  :-  -I  _j 

Ul  > O 111  « 


iO  X u.  o 

oc  X > Ul  o 

UJ  {A  <A  O O 


CD  O 

U)  u>  CO  ' 
o -«  m ■ 


WRE-TR-1842(A) 


16 


FT 


PARSE  - CSS  parser  subroutines  macro  D1013  24'AUC-77  IZiSO  PAGE  1 


1 

2 

3 

4 

5 

6 

7 

8 
9 

10 
I 1 
12 

13 

14 

15 
IS 


TITLE  PARSE  - CSB  PARSER  SUBROUTINES 


A MACRO  parser  POR  TEXT  STRINGS  BASED  LOOSELY  ON  TPARS 
DESIGNED  PRIMARILY  FOR  ANALYSIS  OF  COMMAND-LIKE  STRUCTURES 
BUT  INTENDED  TO  BE  EXTENDABLE  FOR  ANY  GRAMMATICAL  ANALYSIS. 


WRITTEN  BY  G S BRIMBLE 
MODIFIED: 

26/1777  SYNTACTIC 

WITH  NOVA 


28  SEPT  1976 


TYPE  SYMBOLS  CHANGED  FOR  UNIFORMITY 
PROGRAM. 


I ■ 


•1 

i 


! 


:i 

f) 


1 


i 
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P.^RSr  - CS5  I»RPS£P  fyeROUriKES  macro  P1013  i4-AUC-77  12:50  PAGE  2 

iNSfEUCTIOK?  FCR  iJ:£ 


IS 
1 •) 
20 
21 
22 

23 

24 

25 
2C 

2 7 
28 

29 

30 

31 

3 2 
3 3 

34 

35 
3b 
37 

3 3 
3 '' 

4 0 
4 : 
4 2 
4 3 
•1  4 
45 

4 b 
4 7 
43 

49 

50 

51 


sbttl  instructions  for  use. 

; TO  CAUSE  ANALYSIS  TO  8E  CARRIED  OUT, A SUBROUTINE  CALL 
; JSR  PC, PARSE 

; IS  USED  REGISTER  fiO  UILL  CONTAIN  THE  RESULT  OF  THE  PARSE 
; ON  RETURN  - R0*0  IF  SUCCESSFUL,  »-2  IF  FAILED 
; REGISTERS  R3.P4,r;,R6  ARE  UNAFFECTED  BY  THE  PARSE 
; Rl  MUST  POINT  TO  THE  TEXT  STRING  TO  BE  ANALYSED. IF  THE 
; PARSE  FAILED, Rl  UILL  EE  POINTING  TO  THE  BYTECS) 

; WHICH  CAUSED  THE  FAILURE 

; R2  HIST  POINT  TO  THE  ENTRY  STATE  OF  A STATE  TABLE 
; WHICH  DEFINES  THE  STRUCTURE  WHICH  THE  TEXT  MUST  HATCH. 

; THE  STATE  TABLE  CONSISTS  OF  A SET  OF  TRANSITIONS 
: WHICH  ARE  SO  ORDERED  AS  TO  DEFINE  THE  ALLOWABLE  STRUCTURE 
; OF  THE  TEXT. 

; THE  TRANSITIONS  CONSIST  OF  THREE  ENTRIES: 

; SYNTACTIC  TYPE  (PLUS  ANY  TPAILIHC  ARGUMENTS) 

; ACTION  SUBROUTINE  POINTER 

: TARGET  STATE  POINTER 

; THE  ACTION  SUBPOUTINES  ARE  EXECUTED  IF  THE  TRANSITION 
; SUCCEEDS,!  E.  THE  SYNTACTIC  TYPE  MATCHES  THEY  ARE 
; CALLED  FROM  THE  PARSER  BY  'JSP  PC,0POINTEP'  SO  MUST 
; RETURN  TO  THE  PARSER  B ) 'RTS  PC'  REGISTERS  R1.R2,R6  MUST  BE 
; PRESERVED  BY  THE  SUBROUTINES 

; IF  MO  ACTION  IS  NEEDED  WHEN  A TRANSITION  IS  TAKEN, 

; A DO  NOTHING'  SUBROUTINE  IS  PROVIDED  BY  THE  PARSER 
; CALLED  'NULL  ' THIS  IS  SPECIFIED  AS  THE  ACTION 
; SUBROUTINE  POINTER  FOR  NO  ACTION 

; THE  TARGET  STATE  POINTER  POINTS  TO  THE  TRANSITION 
I TO  BE  ATTEMPTED  IF  THE  CURRENT  ONE  SUCCEEDS  IF  THIS 
; ONE  FAILS, THE  FOLLOWING  ONE  IS  ATT E MP TED . THE  END  OF  A 
; STATE  IS  INDICATED  BY  A PSEUDO-TYPE  -I  IF  THIS  IS 
; ENCOUNTERED  AS  A TYPE  THE  PARSE  FAILS 


WRE-TR-1842(A) 
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PARSE  - CSB  PARSER  SUBROUTINES  RACRO  D1013  24-AUC‘77  12:50  PACE  3 

SYNTACTIC  TYPES 


53 

54 

55 
5S 

57 

58 
5^ 

50 

51 

52 
o 3 
S4 

ss 

s s 

fc7 

sd 

59 

70 

71 

72 

73 

74 

75 

7 S 

77 

78 

79 

80 

8 1 
82 
83 
8 4 

85 

86 
87 
G8 

8 9 

90 

91 

92 

9 3 
9 4 
95 


97 
93 
99 
100 
10  1 
102 

103 

104 

105 
lOo 
107 
lOS 
109 


SBTTL  syntactic  TYPES 

; THE  SYNTACTIC  TYPE  ENTRY  IN  THE  TRANSITION  SLOCK 
; HAY  eE  ONE  OF  THE  FOILOUIHG: 


ALPH  . 


I 

i 

i 

t 


NUHB 


KEY  . 


J EOF. 

; HTCH 

; ANY 

; 

; 

; SUBX 


ENDX 


END 


DEFINED  ALPH  »1 

THIS  MATCHES  ALL  FOLLOWING  ALPHABETIC  CHARACTERS 
UNTIL  A NON-ALPH  CHAP  IS  FOUND  IT  AMOUNTS  TO 
AN  'IGNORE  TEXT'  SPECIFICATION  THE  ENTRY  HAS  A 

trailing  one-word  afcumentjthe  lower  byte  is  a 

MAXIMUM  BYTE  COUNT  TO  LIMIT  THE  AMOUNT  OF  TEXT 

TO  BE  IGNORED. AND  THE  UPPER  BYTE  IS  A FLAG 

-0  IF  SPACES(AND  TABS)  ARE  TO  BE  TREATED  AS  ALPHA. 

■1  IF  SPACES  CAN  BE  USED  AS  TERMINATORS 
THUS  THIS  TYPE  ALWAYS  SUCCEEDS  UNLESS  THE  BYTE 
COUNT  IS  EXCEEDED  NOTE  THAT  THIS  TYPE  SUCCEEDS  IF 
THERE  ARE  NC  ALPHABETIC  CHARS 
DEFINED  NUMB  =2 

THIS  MATCHES  A DECIMAL  INTEGER  WHOSE  VALUE  IS 
STORED  IN  The  parser-defined  LOCATION  'HUM  ' 
CONVERSION  STOPS  AT  FIRST  NON-NUMERIC  CHAR. 

IT  ALWAYS  SUCCEEDS  - IF  HO  N U ME RA L S . NU M . = 0 
KEY  *3 

MATCHES  ONE  OR  MORE  ASCII  CHARS  WHICH  FOLLOW  THE 
TYPE  SPEC  AS  TRAILING  PARAMETERS  THESE  KEY 
CHAPS  MUST  TERMINATE  WITH  A NULL  BYTE  THEY  ARE 
EASILY  DEFINED  USING  THE  ' ASCI2  ' OPERATOR  BUT 
THIS  MUST  BE  FOLLOWED  BY  'EVEN'  TO  RETURN  TO 
WORD  BOUNDARIES 
EOF  . *4 

MATCHES  ESC. ALT  MO D E . FF . RU BO U T . CR < AN D LF  IF  FOLLOWS) 
OH  ENTRY  TO  ACTION  SU BP  0 UT I N E , R 3 HAS  THE  CHAR 
MTCH . «5 

this  causes  an  UNCONDITIONAL  MATCH  - ALWAYS  SUCCEEDS 
IT  FORCES  ACTION  AND/OR  NEW  STATE, 

ANY  -6 

PUTS  NEXT  CHAR  INTO  PA R SE R - D EF 1 N E D LOCATION  'CHAR.' 
SUCCEEDS  UNLESS  NEXT  CHAR  IS  EOF  OR  RUB0UT(377) 

SUBX  =7 

THIS  ASKS  THE  PARSER  TO  MATCH  A S U B - E X P PE S S I ON . 

I E TO  ATTEMPT  TO  MATCH  A STRUCTURE  DEFINED  BY  A 
SUBSIDIARY  STATE  TABLE  THE  TYPE  ENTRY  IS  FOLLOWED 
BY  H TRAILING  hRCUMENT  OF  A SINGLE  WORD  CONTAINING 
THE  ADDRESS  OF  THE  ENTRY  STATE  OF  THE  SUBEXPRESSION. 
THIS  SUBEXPRESSION  STATE  TABLE  MUST  CONTAIN  A 
STATE  WITH  THE  TYPE  'ENDX.'  AS  THE  SUCCESS  EXIT. 

end:-;  =10 

SUCCESS  EXIT  FROM  SUBEXPRESSION  THIS  TYPE  CAUSES 
THE  PARSER  TO  CARRY  OUT  THE  SUCCESS  ACTION  AND 
target  in  the  transition  WHICH  CALLED  THE 
SUBEXPRESSION  IT  HAS  NO  ACTION  OR  TARGET  ENTRIES 
ASSOCIATED  WITH  IT . 

END  =0 

THIS  TYPE  IS  A SPECIAL  CASE  IT  INDICATES  SUCCESSFUL 
PARSE  AND  CAUSES  THE  PhRSER  TO  RETURN  CONTROL  TO 
THE  CALLING  PROGRAM  WITH  THE  SUCCESS  CODE  IN  RO. 

IT  HAS  NO  ACTION  OR  TARGET  ENTRIES 


Li 
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PriRSC  - CSC 

PMRSf  R 

SUBROUTINES  (1ACR0  DlOl 

3 24-AUC- 

77  12:50 

FftCE  4 

CEFINI TIONS 

1 1 1 

. SBTTL 

OEriHITlONS 

1 1 2 

; 

1 1 3 

000000 

END  =0 

1 I 4 

00000 1 

ALPH  =l 

ns 

000002 

NUMB  *2 

\ 1 b 

000003 

KEY  =3 

117 

000004 

EOF . =4 

1 1 d 

000005 

riTCH  s5 

1 1 ') 

000006 

ANY . *6 

120 

000007 

SU8X  «7 

12  1 
12  2 

000010 

EHDX  =10 

123 

124 

SBTTL 

CODE 

1 <.  5 

12b 

000000 

PSECT 

PARS< 

;«AK£  PSECT  TO  AVOID  SYMBOL 

1 2 7 

123 

OOOC'OO 

0 H'  3 4 6 

PARSE’ 

: nov 

R3,  -<  SP  ) 

;SAVE  UORKING  REGS 

129 

000002 

0 10446 

nov 

R4  , -(  SP  ) 

130 

00C'U04 

0 1C546 

MOV 

P5.  -(  SP  ) 

1 3 1 

000 006 

010167 

000060  PARS4G 

: MOV 

R 1 . TXTPTf 

;SAVE  CURRENT  TEXT  PTR 

132 

0000  12 

012200 

nov 

< R2 )♦ « RO 

;CET  TYPE 

133 

000014 

00*  300 

ASL 

RO 

;MAKE  IT  UORD  ADDRESS 

1 3 4 

000016 

000170 

000050' 

J HP 

AVECTKRO) 

;VECTOR  TO  ITS  ROUTINE 

13S 

000022 

004732 

SUXS4 : 

J SR 

PC,  0<  R2  )♦ 

;0N  SUCCESS/DO  ACTION 

136 

000024 

0 1 1 202 

MOV 

( R2  ),  R2 

;S£T  TARGET  POINTER 

1 37 

000026 

000167 

177754 

J HP 

PARStC 

; REPEAT 

1 3 3 

000032 

0 1 1 70  1 

000034  PAYL*: 

MOV 

TXTPTf, R1 

IRESTORE  TEXT  PTR  TO  AS 

139 

000036 

OOf  732 

TST 

e<  R2  > + 

; IGNORE  ACTION 

1 4 0 

C 0 C 0 4 0 

005732 

TST 

9( R2  >4 

;AN0  TARGET 

1 4 ; 

000042 

000 167 

177740 

J MP 

PARSf C 

IREPEAT 

1 *♦ ». 

M 3 

0 0OH.I6 

000 102 

WORD 

X I T * 

1 4 1 

0 Oh  u 5u 

00"  1 1 2 

VECT* : 

WORD 

END* 

: 4 

coon  52 

0 0r'»0  2 

U 0 R D 

ALPH* 

14.. 

OOf  a **.4 

0 00  16  0 

UOP  D 

HUM* 

1 ■;  • 

0C.i.05i. 

0 Jl  . . 4 

UOPD 

KEY* 

140 

0 0 0 f.  6 0 

001  124 

U 0 R D 

EOF* 

149 

000062 

00"  1 20 

UQRD 

MTCH* 

150 

009064 

000200 

UOPD 

ANY* 

15  1 

000066 

0Q(  -no ' 

UOPD 

SUBX* 

15. 

000070 

000524 

UQRD 

ENDXf 

154 

0 000  72 

ocooco 

TXTPT$ 

: UOPD 

0 

155 

000074 

000 000 

CHAR  : 

UORD 

0 

15e 

000076 

000000 

HUM. : ! 

UORD 

0 

157 

OOC  00 

000207 

NULL  : 

: RTS 

PC 

WRE-TR-1842(A) 
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aS8  PAS'vLi-*-  ^^USROUT  I res  MmCRO  01013 


M-AUG‘77  1 2:30  PmGC  3 


ROUTINES  TO  MATCH  SYNTACTIC  TYPES 


1 » • 

0 tH‘  1 m2 

0 1.  uC  ’i 

X I T$  ; 

MOV 

< SP )♦ . R5 

IRESTORE  REGS 

16> 

OOC 1 U4 

0 l.t04 

MOV 

^SP  >*  > R4 

1 i 3 

DOO 1 Ob 

C 1 2 6 0 3 

nov 

< SP  )♦ i R3 

1»S4 

OOCl  10 

000207 

PTS 

PC 

1 » 5 

1 ^ o 

0001  12 

005000 

ENO$  : 

CLR 

RO 

;FLAC  SUCCESS 

U7 

000  1 14 

0 00  16  7 

177762 

J riP 

X IT$ 

1 p ■-* 
t ■*  r» 

ooo : 20 

OC-C  U7 

177676 

I1TCH$: 

J NP 

suxs$ 

1 < 'J 

1 r ; 

OOf 1 24 

o 

' -j 

o 

000164' 

EOF*  : 

M 0 V 

•40$, R3 

IPOINT  TO  EOF  TABLE 

1 72 

000 1 30 

121123 

3Qfi 

CMP  8 

( Rl ), L R3  >♦ 

173 

OlJU  •;  3 2 

0 0 ; 4 0 5 

o EO 

1 0$ 

;DOES  IT  MATCH*’ 

174 

OOi. ; 34 

0 2t.32  2 

0001 76 ■ 

CMP 

R3, «50l 

;N0  - END  OF  TABLE*’ 

17^ 

000140 

1 c • 7 : 3 

BIOS 

30$ 

;H0  - TRY  hHOTHER 

1 

'lO'i  1 42 

OC.  16  7 

177664 

J MP 

FAYL$ 

;YES  - HO  MATCH  SO  FAIL 

177 

'.•Of.  1 4 b 

oo'-  ;c  1 

10$: 

INC 

P 1 

JPUSH  POINTER  IF  MATCHES 

irs 

000 1 50 

1 2:  i:r 

00001  2 

CMPB 

< R1  1 12 

;FOLLOUtD  BY  LF? 

1 7 '3 

0 0 0 1 54 

ou: 00 1 

SHE 

20$ 

IS  J 

000158 

0 0!  20  1 

INC 

R1 

;IF  SO,  PUSH  TEXT  PTR 

U 1 

0 00  1 60 

00m  167 

177636 

20$: 

J MP 

suxs$ 

ITHEN  SUCCEED 

132 

OOC 1 84 

OOmOI  5 

40$: 

UORD 

IS 

1 S 3 

000 1 86 

00(-377 

WORD 

377 

ie4 

OCO 1 70 

OOm 1 76 

UOKD 

1 76 

ICS 

noo : 72 

000033 

UORO 

33 

1 s b 

OOl 1 74 

0OMO32 

UORD 

32 

167 

000178 

OOOOl  4 

50$: 

UORD 

1 4 

168 

; 

IC'^ 

000200 

121127 

000377 

ANY$  : 

CMPB 

( R1  )*  •377 

;IF  PUBOUT 

1?C 

OCC204 

001411 

BEO 

1 1$ 

Ilf : 

DuiirOo 

121  12  7 

000032 

CMPB 

< R1  ),  *32 

;0R  END  OF  FILE. 

l?2 

000  2 1 2 

OOI  406 

8E0 

1 1$ 

;THIS  FAILS 

15-3 

0 00  2 1 4 

00506  7 

177634 

CLP 

CHAR 

lOTHERUlSE  CLEAR  UORO  TO 

: C'  4 

MOO J 20 

n 2 ! .:  7 

1 77650 

no;B 

t R1  }♦ , CHAR 

;STORE  BYTE 

1 ?*> 

000224 

OOiMt  7 

177572 

JMP 

SUXS$ 

;and  succeed 

13b 

OCO.  30 

0 Cl'  16  7 

177376 

1 1 $ I 

J MP 

FAYL$ 

l ''0 

0 00^-34 

1 O'  7 12 

KE  Yf  : 

TSTfl 

< P2  > 

;ENO  or  KEYS*’ 

133 

0 00  2 36 

001412 

BEG 

1 2$ 

;THEN  SUCCEED 

2C0 

C CO  2 40 

1 22  w2 

CMPB 

( Rl  )♦  . < R2  )♦ 

;MATCH7 

20  1 

U 00242 

OO: 774 

BED 

KEV$ 

;YES  - 00  ANOTHER 

2C2 

000244 

1 05  72  2 

22$: 

TST8 

( R2  )♦ 

IFAILED  SO  PUSH  PTR  OVER 

203 

OCO*  46 

OOI 376 

BNE 

22$ 

;UNTIL  ZERO  IS  FOUND 

204 

0 00 . !)0 

005202 

INC 

R2 

;Al1CN  to  UORD  BOUNDARY 

205 

0 r.  C 2 5 2 

c 06 00 2 

ROR 

R2 

20 

000254 

00f24  1 

CLC 

20  7 

0QG25b 

00.  102 

ROL 

R2 

208 

0 0 r 2 6 0 

000167 

177546 

JMP 

FAYL$ 

20-> 

0 0 0 2 1/  4 

0 0!  20  2 

12$: 

INC 

R2 

ISUCCESS  - ALIGN  TO  UORO 

210 

0 00  2 C t 

OC’  2C2 

I NC 

R2 

21  1 

000270 

OOt  00  2 

ROR 

R2 

212 

000272 

00024  1 

CLC 

21  3 

000274 

0 06  102 

ROL 

R2 

214 

000276 

000167 

177520 

JMP 

suxs$ 

21 
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C:,;c 

r 6 R £ F 

SOCROUriNE3  MACRO  01C13 

24  'AUC 

-77  12:5C 

PACE 

5-1 

J 1 *.• 

Li  Ml-  .:02 

1 i;  20  3 

ALPHS: 

novc 

< R2  )♦ , R3 

;CET  NO  OF  CHARS 

2\  7 

000304 

1 12204 

HOVB 

l «2  )♦ 7 R4 

;AKD  space  HhC 

213 

000306 

00*’  704 

43«! 

TST 

R4 

;COUMT  SPACES  AS  ALPHA? 

21  ^ 

000310 

001 006 

BNE 

1 3$ 

22  0 

000712 

1 21  127 

000040 

CfIPB 

( R1  >4 140 

lYES  - ANY  SPACES? 

22  1 

000316 

001411 

BEQ 

23$ 

lYES  - CO  AGAIN 

22  2 

000320 

12112  7 

00001  1 

CHPB 

(R1  )4lll 

;TRV  TABS 

223 

000324 

001 406 

CEO 

23$ 

224 

000326 

121127 

000101 

13«i 

CMP8 

( R1  >4  I'A 

; IS  BYTE  < A-> 

225 

000332 

107410 

CLO 

33$ 

;1F  SO  IT  IS  A BREAK  CHAR 

22  b 

000334 

121 127 

000132 

CHPB 

( Rl  >4  I'Z 

MS  IT  > Z'J 

22  2 

000340 

101005 

BKI 

33$ 

MF  SO  BREAK 

22  3 

OOC  3 42 

005201 

23$: 

I NC 

R 1 

IMOVE  TO  NEXT  BYTE 

22? 

000344 

005303 

DEC 

R3 

;OONE  ENOUGH  TEXT? 

230 

000346 

002357 

BCE 

43$ 

INOT  YET  - DO  NEXT  BYTE 

23  : 

000350 

000167 

177456 

J MP 

FAVL$ 

;YES  - FAIL 

2 7 2 

000354 

000167 

177442 

33$: 

J tip 

SUXS$ 

IFOUND  BREAK  SO  SUCCEED. 

2 3 3 
234 

000760 

001.C03 

HUH$: 

CLR 

R3 

ICLEAR  SIGN  FLAG 

23  5 

000362 

005067 

177510 

CLR 

HUM  . 

;AND  SUM  WORD 

23  t 

000  .'66 

121  127 

000053 

CMPB 

( Rl  ).  • ' + 

iPLUS  SIGN^ 

23  2 

000372 

001 404 

BEO 

1 4$ 

2 7 3 

000374 

121 127 

000055 

CHPB 

( Rl 

iflIHUS? 

23'* 

0 C C'  4 0 0 

cc:00i 

BNE 

1 4$ 

;no  sign  so  assume  * 

2-l;l 

000402 

OOf  203 

INC 

R3 

;FLAC  minus 

24  1 

000404 

1 21  127 

000060 

14$: 

CflPB 

( Rl  ).  • '0 

MF  CHAR  < 0 

242 

000410 

1 0742  1 

BLO 

24$ 

24  ; 

000412 

121127 

000071 

CMPB 

< Rl  >»  §'9 

;0R  > 9 

24  4 

0004 16 

101016 

BHI 

24$ 

;THEN  BREAK  AND  FINISH 

245 

0 00 4 20 

112104 

novB 

( Rl  )♦ , R4 

lOTHERUlSE  GET  NUMERAL 

24 

OOi'422 

1 62704 

000060 

SUB 

• ' 0 . R4 

;MAKE  A NUMBER 

24  ' 

OCC  426 

0 1*  70  0 

1 77444 

MOV 

HUM  .RO 

24  8 

Onn  .1  72 

006300 

A5L 

RO 

;nULT  BY  10  THE  HARD  WAY 

24  J 

000  4 74 

OObZOQ 

ASL 

RO 

;S0  THIS  CAN  RUN  ON  PDPll/lO 

250 

0 0 0 4 16 

0bt:067 

1 77434 

ADO 

RO. MUM 

;2*<N0 

♦ <2*<  2*N0  ) ) )■! 0*N0 

2 5 1 

0lifM42 

00*  367 

177430 

ASL 

Nun 

252 

000446 

060467 

1 77424 

ADD 

R4. HUM 

;AOD  IN 

NEU  DIGIT 

2*  3 

000452 

000754 

BR 

1 4$ 

;CET  ANOTHER  DIGIT 

2 ' 4 

imM<  i '-4 

0 0*.  ;'0  3 

24$i 

TST 

P J 

JBREAK  SO  APPLY  SIGN 

255 

00O456 

oni 402 

BEO 

34$ 

25b 

0004b0 

OOt.467 

177412 

NEC 

HUM 

25  7 

000464 

0001o7 

177332 

34$  : 

J MP 

SUXS$ 

25S 

; 

25'' 

0C0470 

010246 

SUBX$  : 

MOV 

R24  -<  SP) 

;SAVE  CURRENT  STATE  PTR 

2c  0 

OOC 4 72 

0 1 1 202 

MOV 

( R2  >.  R2 

;CEr  NEU  ONE 

2i  : 

0004  74 

012767 

000506' 

177344 

MOV 

tSUBXT$ . 

VECT$-2 

;ALTER  ERROR  PTR  IN  TABLE 

2t2 

000502 

000  lb  7 

17730C 

J MP 

PARS$G 

;S0  IF  SUBEX  FAILS  IT  COMES  HERE 

2^3 

000506 

012602 

SUBXT$  : 

MOV 

< SP  >♦ . R2 

;RESTORE  STATE  PTR 

264 

000510 

005722 

TST 

* R2  >♦ 

:SKIP  TRAILING  ARGUMENT 

265 

000512 

012767 

000102* 

177326 

MOV 

•XI T$ . VECT$-2 

IRESTORE  ERROR  POINTER 

26  b 

000520 

000167 

177306 

JMP 

FAYL$ 

;AND  TREAT  AS  NORMAL  TRANS  FAIL 

2«  7 

000524 

012602 

ENOX$  : 

MOV 

( SP )♦ , R2 

ISUBEX  HAS  SUCCEEDED 

263 

000526 

005722 

TST 

( R2  )♦ 

:S0  RESTORE  AS  ABOVE 

269 

000530 

012767 

000102' 

177310 

MOV 

•XI T« . VECT»-2 

270 

000536 

000167 

177260 

JMP 

suxs$ 

;AND  SUCCEED 

271 

; 

272 

000001 

END 

I 
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.TITL  TESTP 

; ILLUSTRATtOM  OF  PARSER  USE. 
t G.  BRIMBLE  A/3/77 
) 

eiieedi  .txtm  i 

; 

; DEFIME  GLOBAL  SYMBOLS  IF  NEEDED  BY  OTHER  TASKS 
1 


. ENT 

CHAR.  .NUH.  . PARSI 

• 

t 

. ENT 

TESTP 

i 

. ZREL 

4- 

00300-030377 

IDNSKi 

377 

JUSER  MUST  DEFINE  THIS  ON  P.  EERO 

00331-033333 

CHAR,  t 

0 

;AND  this  (VID  THE  NEXT  ONE 

00332-933303 

MUM.  : 

0 

00993-033367* 

PARS  It 

PARSE 

i 

.MREL 

09330*  030444 

TESTPt 

LDA 

a.TTIPT 

;OP£N  CH0  TO  TTI 

5 

00031*036917 

. SYSTM 

00302*014390 

. OPEN 

0 

1 

00333*063077 

HALT 

; ERROR 

i 

00304*020444 

LDA 

0.TTOPT 

I OPEN  CHI  TO  TTO 

03035*936017 

. SYSTM 

1 

09006*014091 

. OPEN 

1 

i 

09307' 963077 

HALT 

1 

90310*023444 

LOOPPi 

LOA 

0#PRI1PT 

5 

00011  * 336017 

. SYSTM 

'i  * 

00912*017001 

. WRL 

1 

J INDICATE  READY 

1 

00013*  030775 

JMP 

.-3 

I ERROR  RETUfW 

09914*020444 

LDA 

0.  CML 

000 15*  006017 

* SYSTM 

'j 

00316*  015400 

• RDL 

0 

ir.FT  COMMAND 

/ 

90317*  C.'3775 

JMP 

. -3 

I ERROR 

\ 

1 

00023*  324440 

LDA 

l.CML 

\ 

00021  * 125220 

MOVER 

1 . 1 

IWORD  PTR  TO  TEXT 

00022*039557 

LDA 

2.  STRPT 

1 PREPARE  TO  PARSE 

\ 

00023*  006003- 

JSR 

tPARSI 

;do  it 

\ 

00024*  I3I005 

MOV 

0.  0.  SNR 

;WAS  IT  OKT 

00025*033407 

JMP 

GOODP 

JYES 

j 

09926*  023921 

LDA 

0.TXPT 

; NO -SAY  WHY 

I 

03927*  131 120 

MOV  EL 

0.0 

IBYTE  PTR 

\ 

03039*  036917 

• S'^STM 

\ 

00031*217901 

. ’-'HL 

1 

09332*  223756 

JMP 

LOOPP 

J ERROR 

f 

00033*033755 

JMP 

LOOPP 

00234*029402 

GOODPi 

LDA 

0.  SUXM 

00035*  239773 

f 

JMP 

. -5 

I PRINT  "SUCCESS!" 

1 

00036*039076" 

i 

SUXM: 

.♦1*2 

t 

00337*051525 

.TXT/SU 

1 

03343*041533 

CC 

■> 

1 

9004 1 * 042523 

ES 

30042*  05144  1 

SI 

03943*036490 

<I5>/ 

■? 

03044 'CJS! 12" 

TTIPTi 

. +1*2 

1 

00345*  022124 

- TXT/JT 

i 

00346*  35211  1 

TI 

\ 

03047*  0.33330 

/ 

i 

1 

00353*C-33in2" 

TTOPTi 

• +1*2 

i 

1 
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i 

I 


i 


33aSI'022l24 
fi<J052’a52l  17 

TO 

•TXT/IT 

"I0(»53' 000000 

/ 

03054 •000132" 

PJMPTi 

• ♦ 1*2 

00O55’O52I23 

00056*052075 

T> 

•TXT/TS 

00357*030000 

/ 

00060*  300142" 

CNLt 

. 1*2 

000120 

; 

.BLK  80. 

; TRANSITION  TABLE 
S 


00201*000202*  STRPTi 

START 

00202*030333  STARTt 

KEY. 

00203*052506 

.TXT/UF 

00204*042040  0 

00205*000000  / 

00206*  030477* 

NULL. 

00207*  00021  1 • 

M 1 

00210*177777 

- 1 

00211*000006  Nit 

ANY. 

00212*300477* 

NULL. 

03213*300215* 

N2 

00214* 177777 

- 1 

00215*330006  N2| 

ANY. 

00216*330477* 

NULL. 

00217*000221* 

N3 

00220* 177777 

- 1 

00221*000002  N3t 

NUMB. 

00222*000477* 

NULL. 

03223*000230* 

DEVI 

00224*030005 

MTCH. 

30225*000477  * 

NULL. 

00226*  000230* 

DEVI 

00227* 177777 

- 1 

00230*033003  DEVli 

KEY. 

00231  * 335000 

256.  *"t 

00232*300477* 

NULL. 

30233*000235* 

N4 

00234* 177777 

-1 

00235*000001  N4i 

ALPH. 

00236*  000000 

0 

00237*  000477* 

NULL. 

00243*030242* 

NS 

0024 1 * 177777 

- 1 

00242*330007  NSi 

SUBX. 

00243*000336* 

UIC 

00244*  300477* 

NULL. 

00245*  000247* 

OPTS 

00246* 177777 

- 1 

00247*000304  OPTSi 

EOF. 

00250*000477* 

NULL. 

00251*000366* 

ENDIT 

00252*  000003 

KEY. 

00253*  027400 

256.  •**/ 

00254*000477* 

NULL. 

00255*000255* 

N6 

00256*030003  N6t 

KEY. 

00257*340514 

.TXT/AL 

00260*046117  LO 

i 

i 

i 

] 

\ 

i 

'll 


i 

I 
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r" 


03261 '041475 

C> 

00262' 000000 

/ 

00263'030477' 

MULL. 

00264' 030274' 

ALC 

00265' 000003 

KEY. 

OO266'050I22 

.TXT/PR 

00267 '047475 

0* 

00270' 030000 

/ 

00271 '000477' 

MULL. 

002''2'300300' 

PRO 

00273' 177777 

- 1 

00274' 030002 

ALCt 

MUMB. 

00275' 000477' 

NULL. 

00276' 330247' 

OPTS 

00277'  177777 

- 1 

00300'000003 

PROl 

KEY. 

00301'e5S400 

.TXT/C/ 

00332' 030477' 

MULL. 

00303' 030305' 

SPRO 

00304'  177777 

. 1 

00305' 000003 

SPROl 

KEY. 

00306' 056400 

.TXT/1/ 

00307' 030477' 

MULL. 

003I0'03O247' 

OPTS 

0031 1'030003 

KEY. 

00312 

026000 

.TXT/,/ 

00313 

000477' 

NULL. 

00314 

300305' 

SPRO 

00315 

000003 

KEY. 

00316 

051000 

.TXT/R/ 

00317 

000477' 

NULL. 

00320 

000305' 

SPRO 

00321 

000003 

KEY. 

00322 

053400 

. TXT/W/ 

00323 

033477' 

NULL. 

00324 

030305' 

SPRO 

00325 

030003 

KEY. 

00326 

042400 

.TXT/E/ 

00327 

000477' 

NULL. 

00330 

030305' 

SPRO 

00331 

003303 

KEY. 

00332 

042000 

.TXT/d/ 

00333 

033477 ' 

NULL. 

00334 

003305' 

SPRO 

00335 

177777 

-1 

00336 

000003 

UlCl 

KEY. 

00337 

055400 

.TXT/C/ 

00340 

30fl477' 

NULL. 

00341 

000343' 

Ut 

00342 

177777 

-1 

00343 

030002 

Ull 

NUMB. 

00344 

030477' 

NULL. 

00345 

000347' 

U2 

00346 

177777 

- 1 

01347 

010003 

U2l 

KEY. 

00350 

026000 

.TXT/,/ 

00331 

010477' 

MULL. 

00352 

000354' 

U3 

00353 

177777 

- 1 

03354 

030002 

U3i 

NUMB. 

WRE-TR-1842(A) 
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00355 'oaat??’  NOLL. 

00356*000360'  04 

00357*177777  -1 

00360*000003  04i  KEY. 

00361*056400  .TXT/3/ 

00368*000477*  NOLL. 

00363*030365*  05 

00364*177777  -1 

00365*000010  05:  ENOX. 


00366*000303  ENDITl  END. 

; 

; PARSER  MODOLE 


3 DEFINITIONS 


000020 

STPT-20 

030021 

TXPT-21 

090030 

STACK-30 

000090 

END.  -0 

009001 

ALPH. -1 

090002 

NUMB.-2 

990033 

KEY. -3 

000004 

EOF. -4 

000005 

MTCH.  «5 

090006 

ANY.  -6 

339007 

SUBX.-7 

900010 

ENDX.-I0 

030001 

t 

• 

• TXTM 

1 . 

00367*044021 

i 

PARS El 

STA 

l.TXPT 

JSET  UP  AUTO- INC  PTRS 

00370*014021 

DS2 

TXPT 

IDECR  FOR  AUTO  INC  PTR 

00371*044462 

STA 

1,TSAVE 

00372*  014461 

osz 

TSAVE 

03373*050920 

STA 

8.  STPT 

09374*014023 

DS2 

STPT 

03375*054416 

STA 

3#VECTAB*1  ;set  up  error  retusn 

00376* 126409 

SUB 

1. 1 

00377*044452 

STA 

l.DTEN 

; CLEAR  OLD  BYTE 

00430*  92441 1 

LDA 

1, STKPT 

00401  * 344030 

STA 

1. STACK 

00402*020447 

PARSGt 

LDA 

0,  DTEM 

ISAVE  PRESENT  BYTE 

33403*040447 

STA 

0/DSAVE 

00434*024406 

LDA 

1, VECTAB 

03405*032020 

LDA 

2, 9STPT 

;gf.T  a transition  type 

03436* 14 1090 

MOV 

2/  0 

;IF  -1  PILL  BE  ERROR  CODE 

30407* 133000 

ADD 

1>2 

JMAKE  AN  ADDRESS 

00410*033300 

JMP 

90,2 

••VECTOR  TO  HATCHING  TYPE 

0041  I *030736* 

STKPT  I 

STAK 

00412*900414* 

VECTABi 

• *2 

00413*000000 

0 

00414*030425* 

EMD 

00415*000500* 

ALPH 

09416*003621* 

DNUMB 

00417*000551* 

KEYWD 

09490*000576' 

EOF 

00421  * 330575* 

LAM  DA 

30422*  00061 5* 

AMY 

30423*  000674* 

suax 

00424*  300736* 

EMDX 

27 
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> 

ae42SMflS4e0  chos  sub  a<o  ;successfui.  parse 
00426' 032T6S  JMP  tVECTAB*- 1 ; HETURM  FROM  PARSE 

; 


; NXDAT  GETS  NEXT  DATA  BYTE  IGNORING  SPACES  AND  NULLS 
; RETURNS  BYTE  IN  AC  I 
» 


00427*000000 

0 

00430*054777 

M XDAT  1 

STA 

3»  . - 1 

00431  * 924420 

LDA 

i.dtem 

00432*044416 

STA 

l/LASTD 

00433* 176400 

SUB 

3.  3 

00434*054415 

STA 

3.DTEM 

.'GET,  THEN  CLEAR/ DTEM. 

09435* 125014 

SKP»R 

1 # 1 

I IS  IT  ZERO? 

00436*  00277 1 

JMP 

ANXDAT-  1 

1 ;no-legit.  char 

30437*026321 

LDA 

1, 9TXPT 

JYES-GET  A NEW  WORD/ 

09440*  034000- 

LDA 

3/  IDMSK 

00441* 137400 

AND 

1>3 

IMASK  OFF  LS  BYTE 

00442*  054407 

STA 

3/DTEM 

;AND  SAVE  IT 

00443* 125300 

MOVS 

l<  1 

ISWAP  M£  BYTE  LOW 

00444*034000- 

LDA 

3/ IDMSK 

00445* 167405 

AND 

3/  1/  SNR 

.•MASK  IT  OFF  AND  TEST  FOR  MULL 

00446*  000763 

JMP 

NXDAT»1 

JIF  NULL/ GET  ANOTHER. 

00447*032760 

JMP 

9NXDAT-  1 

1 lELSE  ACI  IS  NEXT  BYTE 

00450*030000 

LASTDt 

0 

t 

i SUCCESS  ROUTINES:  SUXEX(SUXX)RESTORES  THE  LAST  BYTE 
; TAKEN  BY  N XDAT,  SUXS ( SUX)  DOES  NOT.  BOTH  CAUSE 
; EXECUTION  OF  THE  ACTION  ROUTINE  AND  BRANCH  TO  TR/WS. 
; 

; 

S STORAGE  COMMON  TO  NXO  AND  SUX.FALE 
I 


OtlASl '000(300 

DTEMi 

0 

OOA-jP.'  0330110 

DSAVEt 

0 

00A53' 030003 

TSAVEi 

; 

SUXEXi 

0 

00A5A' 03A774 

LDA 

3/LASTD 

1 RESTORE  LAST  BYTE  TAKEN. 

00/(55' 05A774 

STA 

3/DTEM 

00456' 175005 

MOV 

3/  3/  SNR 

00457' 314321 

DS7. 

TXPT 

JIF  DTEM«0/PUSH  THE  PTR  BACK 

33460*  034021 

5UXS: 

LDA 

S/TX^T 

;1F  NEXT  match  FAILS/ 

00461  * 054772 

STA 

3/TSAVE 

; RETURN  TO  HERE. 

00462*036020 

LDA 

3/  9STPT 

00463*005400 

JSR 

0/3 

JDO  ACTION 

00464*322020 

LDA 

4/  flSTPT 

JGET  ADDP.  OF  NEXT  TRANS 

00465*340023 

STA 

O/STPT 

JAND  SET  IT  UP 

00466*014020 

DSZ 

STPT 

; ALLOW  FOR  AUTO  INC 

00467*032531 

JMP 

•PARET 

; RETURN  TO  PARSE 

00470*  034762 

FAYLi 

LDA 

3/ DSAVE 

JNO  MATCH-  RESTORE  TEXT  POINTERS 

30471*054760 

STA 

3/ DTEM 

JTO  AS  UEFO'IE  ATTEMPTED  MATCH. 

00472*034761 

LDA 

3/TSAVE 

33473*354021 

STA 

3/ TXPT 

.00474*  036023 

LDA 

3/  tSTPT 

33475*036020 

LDA 

3/  «*STPT 

J IGNORE  action  AND  TRANS 

00476*032522 

JMP 

9 PARET 

JGO  BACK  TO  PARSE 

I 

1 NULL 

ACTION 

ROUTINE  FOR  OTHER  TASKS 

00477  * 031400 

NULL.  I 

JMP 

0/3 

»|RE-TR-1842(A) 
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1 


» TYPi  MATCHING  ROUTINES  ENTERED  BY  VECTOR  TABLE. 
S 


[ 

1 


1 

I 


I 

r 

I 

I 


00530 

02E020 

ALPH: 

LDA 

0, iSTPT 

IGET  ARGUMENT  i 

00501 

324003- 

LDA 

1.  I OMSK 

j 

00502 

137400 

AND 

0.  1 

IMASK  OFF  BYTE  COUNT  ! 

00503 

044432 

STA 

1/BCNT 

00504 

131300 

MOVS 

0,  0 

i 

) 

00535 

024300- 

LDA 

I. I OMSK 

00506 

123400 

AND 

1.0 

IGET  SPACE  FLAG 

00507 

030404 

JMP 

• +4 

1 

00510 

014425 

STRNGt 

DSZ 

BCNT 

JDONE  BYTE  COUNT? 

0051  1 

030402 

JMP 

. +2 

00512 

003756 

JMP 

FAYL 

;IF  SO/ FAILURE 

00513 

036456 

JSR 

•NXD 

IGET  A BYTE  SAVING  AC0 

00514 

034456 

LDA 

3. A5CA 

005  IS 

136032 

SKPGE 

1/3 

03516 

030404 

JMP 

ENDST 

IFINISH  IF  <A  ' 

03517 

034454 

LDA 

3, ASCZ 

03520 

136432 

SKPGT 

1/3 

IFINISH  IF  >Z  1 

30521 

000767 

JMP 

STRNG 

ICHAR  WAS  ALPH.  SO  DO  ANOTHER 

00522 

131004 

ENDSTt 

MOV 

0/0/ SZR 

lARE  SPACES  ALPHA?  ■ 

03523 

002443 

JMP 

• 5UXX 

1 RESTORE  INMATCHED  BYTE  AND  END  ! 

00524 

034407 

LDA 

3/ ASCSP 

1 YES- IS  THIS  SPACE  OR  TAB? 

03525 

166415 

SKPNE 

3/  1 

< 

1 

03526 

030762 

JMP 

STRNG 

I IF  SO/GET  ANOTHER  ALPH 

00527 

034405 

LDA 

3/ ASCTB 

, 

00530 

166415 

SKPNE 

3/  1 

03531 

330757 

JMP 

STRNG 

T 

00532 

032434 

JMP 

• SUXX 

INOT  ALPHA  SO  SUCCEED 

03533 

330340 

ASCSPi 

40 

' 

00534 

33001  1 

ASCTBt 

1 1 

/ 

00535 

000300 

BCNTi 

0 

03536 

020436 

) 

NXKEYj 

LDA 

0,KTEM 

lALGOR.  SIM  TO  NXD  GETS 

00537 

152400 

SUB 

2/2 

INEXT  KEY  BYTE  IN  AC0. 

00540 

350434 

STA 

2/KTEM 

00541 

131014 

SKPtR 

0/0 

00542 

030404 

JMP 

. +4 

1 MATCH  A BYTE 

30543 

022020 

LDA 

0/  eSTPT 

03544 

040430 

STA 

0/KTEM 

I IF  NULL/GET  AND  SAVE  ANOTHER 

00545 

101300 

MOVS 

0/0 

I SWAP  MS  BYTE  LOW 

03546 

030000- 

LDA 

2/  I OMSK 

I MASK  OFF  THE  NEXT  BYTE 

00547 

143400 

AND 

2/0 

00550 

031400 

JMP 

0/3 

1 CARRY  BACK  IN  AC0 

03551 

034765 

. KEYVD 1 

JSR 

N XKEY 

IGET  A KEY 

00552 

131004 

MOV 

0/  0/S3n 

IIS  IT  END(NULL)? 

03553 

000403 

JMP 

MCHKY 

INO-MATCH  IT 

03554 

043420 

STA 

O/KTEM 

lYES-RESET  STORE 

00555 

332412 

JMP 

• SUX 

I IF  NULL/  SUCCESS  1 

00556 

006413 

MCHKYj 

JSR 

SNXD 

IGET  A DATA  BYTE 

00557 

106415 

SKPNE 

0/  1 

I MATCH? 

00560 

000771 

JMP 

KE'rWD 

lYES-DO  ANOTHER 

00561 

034755 

JSR 

NXKEY 

INO-FAIL  MATCH 

00562 

1 3 1 004 

MOV 

0/0/ SJR 

I IGNORE  KEYS  UNTIL  NULL 

00563 

000776 

JMP 

.-2 

00564 

040410 

STA 

0/KTEM 

I CLEAR  STORE 

00565 

302403 

JMP 

• FALE 

I FAIL  MATCH 

; INDIRECT  pointers  AND 

COMST^tfJTS 

03566' 

030454' 

SUXXt 

SUXEX 

■ 

1- 
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00567' 

000460' 

SUXi 

SUXS 

00570' 

000470' 

FALEt 

FAYL 

00571' 

000430' 

NXDt 

NXDAT 

00572' 

000101 

ASCAt 

"A 

00573' 

000132 

ASCZl 

"Z 

00574' 

000000 

KTCMI 

0 

t 

1 REST 

1 

OF  MATCH 

routines 

00575' 

002772 

t 

LAM DA: 

• 

JMP 

• SUX 

) ALWAYS  SUCCEED 

00576 

006773 

/ 

EOF: 

JSR 

•NXD 

;GET  next  BYTE 

00577 

030410 

LDA 

2»  EOFT 

00600 

151400 

ELOOPt 

INC 

2<2 

1 

00601 

021000 

LDA 

0«0>2 

00602 

100015 

SKPNM 

0<0 

;end  of  possibilitiest 

00603 

002765 

JMP 

9FALE 

;then  fail 

00604 

106415 

SKPNE 

0.  I 

JWAS  IT  A MATCH? 

00605 

002762 

JMP 

• SUX 

JYES 

00606 

000772 

JMP 

ELOOP 

INO 

00607 

000607' 

EOFTi 

• 

00610 

000015 

15 

ICR 

0061  1 

000033 

33 

;esc 

00612 

000176 

176 

;alt  mode 

00613 

000014 

14 

IFF 

00614 

177777 

» 

- 1 

JEND  OF  LIST 

00615 

006754 

• 

ANYi 

JSR 

•NXD 

;GET  a BYTE 

00616 

044001- 

STA 

UCHAR. 

SMD  STORE  IT 

00617 

002750 

t 

JMP 

• SUX 

.•SUCCESSFUL  EXIT. 

00620 

000402* 

PARETt 

t 

PARSG 

00621 

102400 

1 

DMUMBt 

SUB 

0.0 

> BORROWED  FROM  . DBIN 

00622 

040444 

STA 

0.  . EC  1 0 

1 CLEAR  SIGN  WORD 

00623 

040444 

STA 

0>  .ECl  1 

/CLEAR  SUM  WORD 

00624 

306745 

JSR 

• . EC40 

1 GET  A CHARACTER 

00625 

121000 

MOV 

1.0 

00626 

024442 

LDA 

1..EC20 

» TEST  FOR 

00627 

106405 

SUB 

0.  1.  SNR 

00630 

000405 

JMP 

.EC97 

; YES 

00631 

024440 

LDA 

1..EC21 

1 NO.  TEST  FOR 

00632 

106404 

SUB 

0.  l.SZR 

00633 

000404 

JMP 

.EC96 

J MO  EXPLICIT  SIGN 

00634 

010432 

IS7 

.EC  10 

; SET  FLAG  WORD  FOR  NEGATIVE 

; NUMBER 

00635 

006734 

.EC97: 

JSR 

•. EC40 

t GET  another  character 

00636 

121000 

MOV 

1.0 

00637 

024433 

.EC96: 

LDA 

1..EC22 

> ASCII  "0" 

00640 

030433 

LDA 

2.  .EC23 

; ASCII  "9" 

0064  1 

142033 

ADCZ# 

2.  0.  SNC 

; SKIP  IF  > 9 

00642 

106032 

ADCZ# 

0.  l.SZC 

; SKIP  IF  >■  0 

00643 

000406 

JMP 

.EC95 

; NOT  A DIGIT.  THERFORE  A BREAK 

00644 

122400 

SUB 

1.0 

; REDUCE  DIGIT  TO  0-9  BINARY 

00645 

324422 

LDA 

1.  .ECl  1 

; SUM  WORD 

03646 

304412 

JSR 

.EC50 

» MULTIPLY  BY  10  AND  ADD 

30647 

344420 

STA 

1 . . EC  1 1 

J SAVE  SUM 

00650 

000765 

JMP 

.EC97 

1 GET  NEXT  CHARACTER 

00651 

024416 

.EC95t 

LDA 

1. .ECl 1 

i RESULT  TO  ACl 

00652 

125120 

MOVZL 

1.  1 
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30653*014413 

DSZ 

. EC13 

1 

TEST 

SIGN 

00654' 125221 

MOVER 

1<  1<SKP 

t 

POSITIVE 

00655* 124640 

MEG  OR 

1.  1 

s 

NEGATIVE 

00656*044002- 

STA 

1/NUM. 

00657*002707 

JMP 

»SUXX 

• ' 
t 

RESTORE  BREAK  CHAR. 

; R0UTI«)E  TO  MULTIPLY  ACl 

BY  10 

AND  ADD  AC0 

00660* 131120 

.EC50I 

MOVZL 

1,2 

; 

N«2 

00661* 151120 

MOVZL 

2.2 

; 

N*4 

00662* 147000 

ADD 

2,  1 

; 

N*S 

00663* 125120 

MOVZL 

1,  1 

; 

N*5*2 

- N*10 

00664* 137000 

ADO 

a.  1 

t 

ADD  AC0 

00665*001400 

t 

JMP 

0/3 

t 

SUCCESS  RETURN 

00666*000000 

i 

.EClOt 

0 

s 

FLAG 

WORD  FOR  SIGN  OF  RESULT 

00667*000000 

. EC  1 1 1 

0 

i 

RUNNING  SUM  WORD 

00670*030053 

.EC20I 

*'+ 

; 

ASCII 

30671*030055 

.EC21I 

s 

ASCII 

tt.  tt 

00672*300360 

.EC22I 

”0 

i 

ASCII 

M0M 

00673*000071 

. EC23t 

"9 

t 

ASCII 

*•9” 

000571  * 

.EC40>MXD 

$ 

ADDRESS  OF  GET  CHARACTER 

1 

t 

; 

ROUTINE 

00674*  026020 

i 

t 

SUBXi 

LDA 

1/ OSTPT 

SGET  NEW  STATE  PTR 

00675*  030020 

LDA 

2/STPT 

; STACK  CURRENT  STATE  PTR 

00676*052030 

5TA 

2/OSTACK 

00677*  044020 

STA 

1/STPT 

00703*314023 

DSZ 

STPT 

JSET  NEW  STATE  PTR 

00701*326422 

LDA 

1/  0VCTP 

00702*  046030 

STA 

1/ 9STACK 

;SAVE  OLD  ERROR  RETURN 

00703*024417 

LDA 

l/SXTP 

00704*046417 

STA 

1 / #VCTP 

/■NOW  errors  come  here 

00705*002713 

JMP 

•PARET 

;try  to  parse  subex 

00706* 126403 

f 

E^JDXi 

SUB 

1/  1 

JFLAG  SUCCESS 

00707*034033 

SUBXTi 

LDA 

3/ STACK 

00710*021433 

LDA 

0/0/3 

IGET  OLD  ERROR  RETN 

00711*042412 

STA 

0/0VCTP 

00712* 175400 

IMC 

3/3 

00713*031400 

LDA 

2/0/3 

;get  old  state  ptr 

03714*050023 

STA 

2/  STPT 

00715* 175400 

INC 

3/3 

03716*054030 

STA 

3/ STACK 

JTIDY  STACK 

037 17* 125034 

MOV 

1/  I/S7.R 

ITEST  FOR  FAIL 

0n720*O0a650 

JMP 

UFALE 

00721*002646 

JMP 

• SUX 

00722*030707* 

SrTPi 

SUDXT 

00723*030413* 

VCTPi 

VECTAB+I 

900312 

. BLK 

10. 

000736* 

; 

STAK>. 

000030* 

; 

. EMD 

TESTP 

' m 
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ALC 

030274' 

ALPH 

030500' 

ALPH. 

000001 

etiY 

0006IS' 

A'lY. 

000006 

ASCA 

000572' 

ASCSP 

000533' 

ASCTB 

00053A' 

ASCZ 

030573' 

BCNT 

030535' 

CHAR. 

000001- 

CM 

aC030OU 

CML 

030060' 

DD 

030000U 

DEVI 

000230' 

tNUMB 

000621' 

DSAVE 

300A52' 

DTEM 

O00A51 ' 

DZ 

O3OO00U 

EL  OOP 

000600' 

£ND 

0OOA25' 

ENDtT 

000366' 

E^JDST 

000522' 

HUOX 

000706' 

EMDX. 

030010 

END. 

000000 

EOF 

000576' 

EOFT 

000607' 

EOF. 

000004 

FALE 

000570' 

FAYL 

000470' 

GOOD? 

000034' 

IDMSK 

030000- 

KEYWO 

0005SI' 

XEY. 

300003 

HE. 

O00000U 

OTEM 

000574' 

LAMDA 

000575' 

LASTD 

000450' 

LOOPP 

030010' 

MCHKY 

000556' 

MTCH. 

000005 

HI 

000211 ' 

NS 

000215' 

H3 

300221 ' 

MA 

030235' 

H5 

000242' 

N6 

000256' 

HULL. 

000477' 

NUMB. 

000002 

HUM. 

000002- 

H.VD 

000571 ' 

NXDAT 

000430' 

NXKEY 

000536' 

OPTS 

000247' 

or^E 

030000U 

PAHET 

003620' 

PARSE 

000367' 

PAR  so 

000402' 

PARS  I 

000003- 
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PHMPT 

000054* 

PRO 

000300* 

SPRO 

000305* 

STACK 

000030 

STAK 

000736* 

START 

000202’ 

STKPT 

00041 1 * 

STPT 

000020 

STRNG 

000510* 

STRPT 

000201  * 

SU3X 

000  674* 

SUBXT 

000707* 

SUBX. 

000007 

SUX 

000567* 

SUXEX 

000454* 

SUXM 

000036* 

SUXS 

000460* 

S 

UXX 

000566* 

SXTP 

000722* 

TESTP 

000000* 

TSAVE 

000453* 

TTIPT 

000044* 

TTOPT 

000050* 

TXPT 

000021 

U1 

000343* 

U2 

000347* 

U3 

000354* 

U4 

000360* 

US 

000365* 

UIC 

000336* 

ULL. 

O00000U 

VCTP 

000723* 

VECTA 

000412* 

.CCIO 

000666* 

• ECl  1 

000667* 

.ECR0 

000670* 

.EC21 

03067 1 * 

• EC22 

030672* 

.EC23 

000673* 

.EC40 

000571  * 

.EC50 

000660* 

.EC9S 

000631' 

.EC96 

000637* 

.EC97 

030635* 
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APPENDIX  V 
GLOSSARY  OF  TERMS 


BNF: 

FINITE-STATE  GRAMMAR; 

FINITE-STATE  PARSER: 

FORMALISM: 

GRAMMAR: 

META-COMPONENT: 

META -LANGUAGE: 

OBJECT: 


PARSER: 

S 

PHRASE -STRUCTURE  GRAMMAR; 
> 

PSEUDO-OBJECT: 

ii 

REGULAR  GRAMMAR: 
SEMANTICS: 

STATE  TABLE: 

SYNTACTIC  TYPE: 

SYNTAX: 

TRANSITION  TABLE: 


Backus-Naur  Form,  a method  to  describe  languages. 

A set  of  rules  which  define  a language  in  which 
there  are  a finite  number  of  combinations  of 
elements . 

A mechanism  to  determine  whether  input  data  conforms 
with  a given  finite-state  grammar. 

Formal  structure. 

Rules  defining  a language. 

Parts  of  a language  being  defined  in  BNF. 

A language  which  can  be  used  to  describe  languages. 

A mark  or  symbol  which  is  a basic  undefinable  element 
of  the  language  being  described;  in  particular,  a 
string  of  letters  may  become  a basic  symbol 
independent  of  the  individual  letters  from  which  it 
is  composed. 

A mechanism  for  deciding  whether  input  data  conforms 
to  a particular  grammar. 

A set  of  rules  which  define  a language  by  building 
more  complex  structures  from  aggregations  of 
elements  and  other  structures. 

A meta-component  which,  although  built  up  from 
objects,  is  a useful  conceptual  unit  and  can  be 
treated  as  though  it  was  itself  an  object. 

A finite-state  grammar. 

The  meaning  of  constructs  in  a grammar;  their 
significance  in  the  context  in  which  they  are  used. 

A data  table  which  can  direct  a finite-state  parser 
by  specifying  how  the  elements  of  the  language  must 
be  combined. 

An  object  or  pseudo-object. 

The  arrangement  of  elements  of  a language  according 
to  its  grammar. 

State  table 
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1 

: <letter> 

; meta-component 

1 

: '<letter>  ; meta-component 

2 

; target 

2 ; target 

<anything  else> 

; next  meta-component 

-1  ; end  of  state 

5 

; its  target 

2 

* = 

2 

: 

; first  meta-component  of 

3 

3 

; new  state 

-1 

<any thing  else> 

3 

: <letter> 

5 

4 

3 

: <letter> 

-1 

4 

4 

<^perator> 

<anything  else> 

3 

5 

*^ull> 

4 

: <3perator> 

6 

3 

-1 

'^ull> 

6 : 

SUCCESS 

6 

■Anything  elsc> 

5 

5 

; ERROR 

6 

: SUCCESS 

Figure 

1(a) 

Figure  1(b) 

Figure  1.  Evolution  of  list  structure 
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